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Abstract 

Price range contains important information about the asset volatility, and has long been considered 
an important indicator for it. In this paper, we propose to jointly model the [low, high] price range as 
a random interval and introduce an interval-valued GARCH (Int-GARCH) model for the corresponding 
[low, high] return range process. Model properties are presented under the general framework of random 
sets, and the parameters are estimated by a metric-based conditional least squares (CLS) method. Our 
empirical analysis of the daily return range data of Dow Jones component stocks yields very interesting 
results. 


1 Introduction 


Assets volatility plays an essential role in modern finance. It provides a measure of variability for the 
asset price over a certain period of time, and is a key parameter in many financial applications such as 
financial derivatives pricing, risk assessment, and portfolio management. Because volatility is not observable, 
statistical modeling that produces accurate volatility estimation and prediction is of great assistance. The 
squared return of log prices, being the classical estimator of variance, used to be the “ideal” proxy of volatility. 
As a result, many volatility models built on returns were proposed and, for a long time, had been very 
popular and successful. The celebrated ARCH (Engle 1982) and GARCH (Bollerslev 1986) models are 
examples of this type. Recently, as the high-frequency transaction data become available, price changes 
can be practically monitored in a continuous way, and the traditional low-frequency (e.g., daily) return 
is no longer quite representative of the volatility. For example, a small return does not necessarily imply 
low volatility, as the price may fluctuate a lot and close at a similar level to the opening. On the other 
hand, a big return could only be the result of a very different opening price from the previous day’s closing. 
Therefore, the return-based models, using too little information, are likely to produce inefficient or even 
incorrect estimates of the volatility. 

In fact, since closing price is only a “snapshot” among numerous prices during a day, there is nothing 
special about it and return needs not to be calculated solely from it. With the availability of high-frequency 
data, intuitively, the log difference between any two observed prices in two consecutive days can be called a 
“return”. This idea leads to a naturally generalized concept of daily return, which is an interval that includes 
all the “snapshot” returns. Let yt (s) be the log price of an asset at time s on day t. Ideally s should be a 
continuous time index. But since price can only be observed at discrete times even for high-frequency data, 
s is assumed to be a discrete index here. We define the interval-valued daily return as 


n 


min {y t (s) - y t -i (w)} , max {y t (s) - y t _i ( w )} 

s,w s,w 

min{y t (s)} - max {yt-i (ic)} , max{y t (s)} - min{y t _i (u>)} 
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Namely, ry is the range of “snapshot” returns during one day, which, apparently, contains more information 
about the daily volatility than the traditional closing-to-closing return. Our goal is to build a volatility 
model that reveals the dynamics of r t as an interval. 

Throughout the rest of the paper, we will view r t as a random interval and model its dynamics under 
the framework of random sets. To facilitate our presentation, we now briefly introduce the basic notations 
and definitions in the random set theory. (See, e.g., Kendall 1974, Matheron 1975, Artstein and Vitale 1975, 
Molchanov 2005, Sun and Ralescu 2014.) Let (17, C, P) be a probability space. Denote by 1C (R d ) or 1C the 
collection of all non-empty compact subsets of R d . In the space 1C, a linear structure is defined by Minkowski 
addition and scalar multiplication, i.e., 


A -\- B — (A T b \ cl £ A, b £ B ]-, AA — (An : a £ A} , 


( 2 ) 


\/A,B £ 1C and A £ R. A natural metric for the linear space 1C is the Hausdorff metric pn, which is defined 
as 

Ph (A, B) = max I sup p (a, B ), sup 

\aeA beB 

where p denotes the Euclidean metric. A random compact set is a Borel measurable function A : 17 — > 1C, 
1C being equipped with the Borel u-algebra induced by the Hausdorff metric. For each A £ 1C (R d ), the 
function defined on the unit sphere 1 : 

(u) = sup ( u , a), \/u £ S ,d_1 , 

aeA 


P (b, A) I , VA, B £ 1C, 


is called the support function of A. If A(w) is convex almost surely, then A is called a random compact 
convex set. Much of the random sets theory has focused on compact convex sets via their support functions. 
Especially, a one-dimensional random compact convex set is called a random interval. 

Under the linear structure ([2]), the random interval r t can be alternatively defined as 


n 


min {y t (s)} , max {y t (s)} 


min {yt-i (w)} , max{i/ t _i (u>)} 


(3) 


Namely, rt can be viewed as the return of daily price ranges. An immediate observation from equation (j3| is 
\r t \= min {y t (s)} , max {y t (s)} + min {y t -i («;)}, max {y t -i (w)} , (4) 


S 


L W 
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where | • | denotes the Lebesgue measure or the length of an interval. That is, the length of r t is the sum of the 
high-low log price ranges of the two corresponding days. Feller (1951) derived the asymptotic distribution 
of the range of a Wiener process. This, together with the classical stochastic volatility models, provides 
the theoretical evidence that the log price range contains rich information of the integrated variance, and 
consequently the daily volatility too. It in turn explains from another perspective that r t is an invaluable 
resource for estimating the daily volatiity. 

There has been a great deal of effort in the literature on volatility modeling using the high-low (log) price 
range of either low-frequency or high-frequency data (Garman and Klass 1980, Parkinson 1980, Rogers and 
Satchell 1991, Kunitomo 1992, Alizadel et al. 2002, Chou 2005, Engle and Gallo 2006, Brandt and Jones 
2006, Christensen and Podolskij 2007). One common characteristic of the existing methods is that they 
consider the high-low price range as an indicator of the volatility and include it explicitly into the modeling 
by functions of certain forms. In this paper we take a different approach. We focus on the interval-valued [low, 
high] return range as opposed to the point-valued high-low price range. It is seen from the above discussion 
that the information from the high-low price range is already contained in the return range. In addition, 
the return range also includes the return itself somewhere in the interval. Therefore, careful modeling of the 
[low, high] return range as a whole is expected to produce promising estimate of the volatility that accounts 
for information from both the return and the price range. To this end, we propose to model the return 
range process {rt} by developing a generalized interval-valued GARCH (Int-GARCH) model. Under this 
new framework, the classical point-valued GARCH process becomes the degenerate case of our Int-GARCH 
model, where the return range r* has zero length. Our theoretical results are two-fold. We first show that 
under certain conditions our Int-GARCH model achieves a weak stationarity that is characterized by a time 
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invariant mean and variance. Then, under the assumption of weak stationarity, we define and give the 
explicit formula of the autocorrelation function (ACF) of the Int-GARCH process. We propose a conditional 
least squares (CLS) method to estimate the model parameters, which is implemented by a Newton-Raphson 
algorithm. Simulation results are consistent with our theoretical findings and our CLS estimates are very 
satisfactory. 

For empirical study, we analyze the Dow Jones component stocks data using our proposed Int-GARCH 
model. It is hard to judge how well it estimates the volatility, but we manage to demonstrate the charac¬ 
teristics and advantages of our Int-GARCH via comparisons to both GARCH and realized volatility (RV). 
It is shown that Int-GARCH model, by implicitly utilizing the intraday data (i.e., daily price range), has 
the advantage of RV to reflect the intraday price variability, which the return-based models such as GARCH 
are usually insufficient for. The idea to embed high-frequency volatility measures such as RV (Andersen et 
al. 2001, Barndorff-Nielsen and Shephard 2002) and realized kernel (Barndorff-Nielsen et al. 2008) into low- 
frequency models is not new. For example, the GARCHX (Engle 2002), HEAVY (Shephard and Sheppard 
2010), and RealGARCH (Hansen et al. 2012) models are all important contributions along this direction. 
As certainly these compound models improve over the daily models with intraday information, their strong 
dependence on the realized measures makes them sensitive to any uncertainty or bias in these measures. 
For instance, it is well known that RV suffers from microstructure noises (see, e.g., Hansen and Lunde 2006, 
Brandi and Russell 2006, 2008). So, in order for such a compound model with RV to achieve its optimal 
performance, caution needs to be taken, either to choose the right sampling frequency or the appropriate 
remedy such as pre-averaging (Jacod et al. 2009) and subsampling (Zhang et al. 2004), to ensure that RV 
well measures the intraday volatility. Based on this discussion and the results of our empirical analysis, we 
believe that our Int-GARCH model makes the contribution of systematically integrating the low and high 
frequency volatility measures in an effective and operationally simple way, without obvious vulnerability to 
the microstructure noises. We present in our study that it is especially useful when assets are frequently 
traded and the intraday variation is significant. 

The rest of the paper is organized as follows. We formally introduce our Int-GARCH model in Section 2. 
Main theoretical results are presented in Section 3 and Section 4. Section 5 proposes the conditional least 
squares method for estimating model parameters and carefully investigates its performances by a simulation 
study. Empirical study with the Dow Jones stocks data, as well as a detailed discussion, are reported in 
Section 6. We finish with concluding remarks in Section 7. Proofs and useful lemmas are deferred to the 
Appendix. 


2 The Int-GARCH model 

We assume observing an interval-valued time series of the form 

ft = [V — St., At + <$t], t = 1, 2, • • • ,T. 


That is, {A i }^ =1 and {St}J =1 are the associated center and radius processes, 

A practical example of we consider in this paper is the daily return 

the information set up to time t, i.e. 

T t = cr {r s : s < t} . 

We are concerned with the conditional variance of r t given J~ t -\. 

The GARCH model depicts the conditional variance of a point-valued return process as a linear func¬ 
tion of the past returns and variances. This was inspired by the fact that assets returns usually exhibit 
volatility clustering: large variations in prices tend to cluster together, resulting in separate dynamic and 
tranquil periods of the market. Extending this spirit to the interval-valued process {rt}, one would expect, 
conceptually, a model such like 

H?=g{H 2 s y s -.s<t- 1), (5) 

where Hf denotes the conditional variance of rt given Tt- 1, and g is linear in H% and r%. We point out 
that Tg in ([ 5 ]) is only a notation, representing an estimate of the past variance. Actually, the square of 
a set has not been formally defined in the literature yet. To realize such a model, we need to have a 


both of which are observable, 
range process. Let Tt denote 


3 


mathematical definition for the conditional variance of a random interval r t and explicitly express it in terms 
of the observable random variables A t and S t . 

The variance of a compact convex random set was originally introduced by Lyashenko (1982) and further 
studied in Korner (1995). Applying these results to the random interval r t , via straightforward calculations, 
we obtain 


E(r t ) = [E(A t ) — E(<5 4 ), E(A t ) + E(<5 t )], (6) 

Var (rt) = Var(A t ) + Var((5 t ). (7) 


The interpretation of Q is obvious: the variance of a random interval consists of the variances from both 
the center and the radius. Considering ([7]) and assuming EAf = 0, a reasonable function g in ([5]) seems to 
imply 


Ht ~ P + a i ^t-i + $t-i ~ (ESt-i)' 


i= 1 


*=l 


where p > 0, q > 0. Since the unconditional mean of St is a constant and therefore can be absorbed into the 
parameter p, the above equation is simplified to 


Hf 


p q 

M + X/ ai t-i + $t-i\ + 

2=1 2—1 


To give more flexibility to our model, we allow for different degrees of dependence of H t on the past centers 
and radii. In addition, we propose to model volatility H t directly, instead of via the conditional variance H^. 

Given the above discussion, we propose the following Int-GARCH ( p , q , w) model for the return range 
process: 


n 

= h t ■ v t , 

(8) 

Vt 

= [et~Vt,et+Vt], 

(9) 


~ W(0,1), 

(10) 

Vt 

~ r(M), 

p q w 

(11) 

h t 

= p + ctj|At-ii + y ] pjSt-j + 7 tht-t, 

(12) 


2 — 1 2=1 2=1 


where p > 0 , q > 0 , w > 0 , and {at : i = 1 , • • • ,p} , {Pi : i = 1 , • • • q} , { 7 * : i = 1 , • • • , te} are positive con¬ 
stants. In (| 8 |, denotes the scalar multiplication. Although we pose parametric assumptions on the error 
terms e t and r] t to simplify our presentation here, they are not really necessary. In practice, it is best to 
use the true data generating distributions, which vary from data to data. So, in replacement of (10)-(lll, a 
relaxed yet sufficient specification for e t and is 


E {e t \r t \) = 0, 

(13) 

E {vt\F t '\) = k. 

(14) 

Vt > 0. 

(15) 


We can get some insights into this interval-valued model by breaking it down into point-valued models. For 
example, let hl t denote the high-low log price range on day t. Then, in view of it is easily derived from 
the Int-GARCH model that 

hit hit —1 — 2/i t ?y t , 


or equivalently, 


hit = 2h t r]t - hl t -1 - 


Separately, denoting by Wt the close-to-close return on day t, we have 


(16) 


w t = h t (e t + apt ), 


(17) 
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for some a £ [—1,1]. The reduced model ( |T6| )- implies 

Vax (hl t \T t -\) = kkh 2 , 

Var (wt\Et-i) = (l + a 2 k) h 2 . 

The return wt can be anywhere in the return range ry, so the exact value of a in the above equations is not 
known. But it should not be important anyway, as a single close-to-close return is not necessarily informative 
about the volatility. The idea of Int-GARCH is to combine all these factors and calculate the conditional 
variance of the whole interval r t as 

H 2 = Var (h t e t ) + Var (h t r) t ) = h 2 (k + 1) oc h 2 , 


where H t is used as the estimated volatility. Finally, we notice that when k —> 0, the radius St 0, and the 
model (|8])-(12l reduces to the usual point-valued GARCH model, except that the linear dependence on the 
past is specified for the conditional standard deviation. 


3 Distribution of Int-GARCH (1,1,1) 

Similar to the GARCH model, the Int-GARCH(1,1,1) process is the simplest but most often an effective 
model for analyzing interval-valued time series with conditional heteroskedasticity. In this section, we derive 
several important distribution properties of Int-GARCH (1,1,1). Before we present our theoretical results, 
we first notice that for the Int-GARCH (1,1,1) process, 

ht. = p + cki | A*—1 1 + pSt.-i + "fiht-i 

= p + ai\e t -i\h t -i +/3r)t-ih t -i +7i^t-i 
= P + (c*i|e t _i| + /3r) t -i +7i)/i t _i. 

Therefore, defining the i.i.d. random variables x t = cki| e*_ 1 1 + + 7i, t £ N, h t can be re-written as 


h t = p + x t h t - 1 . 


We will use (18) throughout this section to simplify notations. 


(18) 


3.1 Weak stationarity 

It is derived in Korner (1995) that the covariance between two random intervals is the sum of the covariances 
between the two centers and two radii. This implies 

Cov(r t , r s ) = Cov(A t , A s ) + Cov(S t , S s ), s,t £ N. (19) 


We are ready to extend the notion of weak stationarity to interval-valued time series in the obvious way. 


Definition 1 . An interval-valued time series {r t } is said to be weakly stationary, or second-moment sta¬ 
tionary, if its unconditional mean E(rt ) and covariance Cov (rt,rt+ s ) exist and are independent of time t 
for all integers s, where E(r t ) and Cov(rt,rt+ s ) are given in © and respectively. 

The existences of E (r t ) and Var (r t ) are closely related to the those of E (ht) and E (h 2 ), respectively. 
In fact, Eh 2 < oo implies the existences of the first two moments of r t . We give precise conditions in the 
following two theorems. 


Theorem 1. Consider the Int-GARCH model © -\1 with p = q = w = 1. 
infinite past with a finite mean. Then, Eh t < oo if and only if Ex t < oo, i.e. 


Assume {r t } starts from its 


ai 



+ /3ifc + 7i < 1. 
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When this condition is satisfied, 


( 20 ) 


Eh, = 


1 - an^/2/Tr- k - 7i' 


and 


Er t = [-kE (h t ), kE (h t )]. 


( 21 ) 


Theorem 2. Consider the Int-GARCH(1,1,1) model {r t } as in Theorem [7] Assume {r t } starts from its 
infinite past with a finite variance. Then E (h 2 ) < oo if and only if E (x 2 ) < E ( xt ) < 1, i.e. 


(*) ai\ ~ + Pik + ji < 1 ; 


(ii) 


/ 2 l~2 

-k + 2ai7i 1 / —I- 2/3i7ifc 
7r \ tt 


< 1. 


a i\/ ¥ + ftfc + 71 


When these conditions are satisfied, 


E (h^) = W 


C\ + 1 


(C2-I) (C i-l)’ 


( 22 ) 


and 


Var (r t ) = (l + k + k 2 ) E (h 2 ) - k 2 [E (h t )] 2 , 


(23) 


where E (ht) is given in (27), and C\ 


E(x t ), C 2 = E (x 2 ). 


So far we have found equivalent conditions for the existence of a time-invariant (unconditional) mean 
and variance. In order to guarantee weak stationarity, by definition, we still need to find conditions under 
which the (unconditional) covariances are finite and time-invariant. According to the following Theorem [ 3 ] 
these conditions turn out to be the same as those for the existence of variance. This is not surprising as 


I Co v(r t ,r t+h ) | < \Cov (X t , X t +h) \ + |Cov (S t , S t+h ) \ < Var (A t ) + Var (S t ), 


and, for a time series model with specified recursive structure, the existence of any unconditional moment 
usually implies time-invariance too. We summarize this conclusion in Corollary following Theorem [3j 

Theorem 3. Consider the Int-GARCH(1,1,1) process {r t }. Under the assumptions of Theorem [^[ the 
covariance of any two random intervals r t and rt+ s is given by 


Cov (r t ,r t + s ) 


(1 + k + k 2 ) E (h 2 ) ~ k 2 [E (ht)} 2 , s = 0; 
kE (h t h t+s rjt) - k 2 [E (h t )] 2 , \s\ > 0, 


where E (ht) and E (h 2 ) are given in (27) and (22), respectively, and E (htht+sPt) is calculated explicitly in 
Lemma [7] (see Appendix). 


Corollary 1. The Int-GARCH! 1,1,1) process is weakly stationary, or second-moment stationary, if and 
only if E (x 2 ) < E (x t ) < 1. 


3.2 Auto-correlation function (ACF) 

The notion of the variance and covariance for compact convex random sets were naturally extended to the 
correlation coefficient of two random sets A and B, which is defined as 


Corr (A, B) 


Cov (A, B) 
-y/Var (A) Varflfy' 


(24) 


Based on this definition, we calculate the auto-correlation function (ACF) of the Int-GARCH(1,1,1) process 
and give the result in the corollary below. 


6 










Corollary 2. 

process {r t } is 


Under the assumptions of Theorem [^J the auto-correlation function of the Int-GARCH( 1,1,1) 

fl, s = 0 

P(s) = l kE (h t h t+sVt ) - k 2 [E(h t )] 2 Q 

[ (1 + k + k 2 ) E (h 2 t ) - k' 2 [E {h t )f ’ 


We plot the ACF for a specific Int-GARCH (1,1,1) model (Model I in the simulation) in Figure 1. We see 
that the centers are uncorrelated. This has been verified by (381 in the proof of Theorem [3] (We will elaborate 
more on this in the subsequent section.) The radii, or the lengths of the intervals, have a relatively persistent 
auto-correlation, which coincides with the phenomenon of “volatility clustering”. This long-term dependence 
of radii carries over to the intervals as a whole, and results in a slow-dying ACF of the interval-valued process. 



Figure 1: Theoretical auto-correlation function of Model I. 


4 The general Int-GARCH (p,q,w) 


4.1 Mean stationarity 


We provide the necessary and sufficient conditions of mean stationarity for the general Int-GARCH (p,q,w) 
model in the following theorem. 

Theorem 4. Consider the general Int-GARCH model |ff|)-|.?,g[). Define 

x i,t = Oii\tt\I{l<i<p} + Pi\vt\l{l<i<q} + (25) 


and 

E(X itt ) = m, (26) 

where i = 1, 2, •• • ,k = max{p, q,w}. Assume {rj} starts from its infinite past with a finite mean. Then, 
Eht < oo if and only ifz2i=iPi < 1- When this condition is satisfied, 


Eh t 


1 ~ 2_/i=l Ti 


(27) 
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and 


Er t = [-kE (h t ), kE ( h t )]. 


(28) 


4.2 Relationship to GARCH 


It is well known that the ARCH/GARCH types of models are heteroskedastic but serially uncorrelated. 
From the proof of Theorem [3] the centers in the Int-GARCH(1,1,1) model are also uncorrelated. And, 
in general, the centers in any Int-GARCH(p,q,w) model are uncorrelated. This is not a coincidence. By 
noticing X t = etht, we see that ht is the conditional standard deviation of the center A t- The GARCH(p,q) 
process models h t by 

p q 

h t=h + Yl a * X t-i + Pi h t-n P >0 ’ ? > 0. (29) 

2=1 2=0 


Compared to this, except for using h t directly instead of hq, our Int-GARCH(p,q,w) specification for h t (12) 
has only one extra term, which is a linear combination of the past radii {5t-i : i = 1, • • • , q}. Therefore, the 
center of our Int-GARCH model is just a GARCH process with one external variable that is the half-range 
of the price. Our Int-GARCH process as a whole further systematically models the interaction between the 
average and the range of prices under the random sets framework. 


5 The conditional least squares (CLS) estimate 


We propose a conditional least squares method to estimate the parameters of the Int-GARCH model. 
Throughout this section, we denote by 9 the parameter vector that contains all the Int-GARCH param¬ 
eters {k\ /i; a\, ■ ■ ■ , ,{3 q ; ,7™} as its components. Conditioning on the past, the expected 

value of the current interval r t is 

f t = E[r t \ E t -i] 

= E{[h t (£ t -r]t),ht(£t + r] t )]\J r t-i} 

= [E {h t (et — Vt) \E t— 1 } > E {h t (e t + rjt) \T t-i}] 

= [—kht,kht], 


where 

p q w 

ht=n+Q-i i a*—* i+ PiSt-i+ 7 1^*-*• 
i=1 i= 1 i=0 

A widely used metric in the space ICc of compact convex subsets of R d is given by 


(30) 


S(A,B) = 


I 


|s a (m) - s B (u)\ 2 fi (d u) 


(31) 


where A 7 B £ ICc and ^ is the normalized Lebesgue measure on S d 1 (see, e.g., Korner 1995). Letting 
A = r t , B = f t in (311, we have 

$ (fti ft )2 = Tj | [At — 5t + kht] + [A* + St — kht] |, 

Our CLS estimate 9 is defined such that the sum of the squared ^-metric between r t and r t is minimized, i.e. 

t 'i 

(32) 


9 = argmin < y^ ||rt — [1 2 r := argnnn 

e e 


in {4 ( 6 *}} . 


Notice that the recursive formula in (30) requires starting values ho and tq. We assume that the process {r t } 
starts from its infinite past with a finite mean and variance, and therefore it is reasonable to let ho = E (ht). 
An alternative is to let ho = 0, assuming {r*} starts from a constant interval tq. Based on our experience, 







the choice of ho does not affect the accuracy of the final estimate, given a reasonably large sample. In either 
case, we let ro = E (r t ). 

In principle, an estimator that makes full use of the distribution information such as the maximum 
likelihood estimate (MLE) is more desired, especially in the situation with heteroskedasticity. However, the 
(conditional) likelihood for the Int-GARCH model involves distribution functions of both e t and rj t , and so the 
(conditional) MLE is computationally much more expensive than the CLS. As will be seen in the empirical 
analysis section, it is the interval length that plays the dominant role in the model. This implies that the 
distribution information of 7 / 4 , which is largely reflected on the value of k, is dominating that of e t . By 
switching from conditional MLE to CLS, we lose the minor information of et, but our gain of computational 
efficiency is substantial. In the rest of the section, we will restrict our attention to the Int-GARCH(1,1,1) 
model. We first give explicit formulae for calculating the CLS estimate using a Newton-Raphson algorithm. 
This necessitates the computation of initial parameters. An initialization scheme is provided based on the 
method of moments. Finally, we carry out a simulation study to examine the performance of our proposed 
CLS method, and the results are very satisfactory. 


5.1 A Newton-Raphson algorithm 


The parameter vector in the Int-GARCH(1,1,1) model is 9 = [fc,/r, a±, /3i, 71 ] . Plugging (30 1 in (32 1 , the 
conditional least squares function L ( 6 ) is explicitly expressed as 




|[At — St + k (fi + ai |A t _i| + /3i6 t -i + 7i^t-i)] 

t=1 

+ [At + St — k (p. + ai |A t _i| + PiSt-i + 7i/if_i)] |. 


Consequently, the gradient vector L and the Hessian matrix L" ( 9 ) are found to be 



and 


L"(i n =2 el 


fc 2 |At_! 


k 2 S t -1 


k 2 h t -1 


A 2 |A t _i| k 2 |A t _i| k 2 S t -i\Xt-i\ k 2 h t -i\\t-i\ 


k 2 S t -i k 2 |At_i| <5 *—1 


k 2 S 2 .1 


k 2 h t _i5 t -i 

k 2 h 2 t _ 1 


(33) 


k 2 h t -1 k 2 \X t -i\h t -i k 2 S t -ih t -i 
The Newton-Raphson Algorithm thus consists of an iterative computation of the following formula 


Q( k + 1 ) = Q( k ) _ 


L 


■ L’ 


where V 


and 


L" (e^ 1 are the gradient vector and inverse Hessian matrix evaluated at the k th 


step estimate 9 respectively. 


5.2 Parameter initialization 

We use the method of moments to get an initial estimate of 6. Notice that 


E(6 t ) 

= E (h t 6 t ) = E (h t ) E (S t ) = kE (h t ), 

(34) 

E A* 

= E\h t et\=E(h t )E\e t \ = y/2frE(h t ), 

(35) 

E(h t ) 

h 

1 - ai y/2/n - fak - 71 

(36) 
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Equating E |A t | with its sample mean |A*| in (351, we get the moment estimate of E{ht). That is, 


E(h t ) = \l-\Xt\. 


Then, replacing E ( ht ) by E ( h t ) and equating E (<5 t ) with its sample mean |<$ t | in (341, we obtain an initial 


estimate of k. Similarly, /i is initialized by replacing E (h t ) by E (h t ) and a rough guessing (for example, 0.4) 
of 1 — a\ y/2/TT — (3\ k — 71 in |36|. Finally, initial estimates of «i, /3i, and 71 are obtained by setting each of 
a ly /2fr, /3ifcj and 71 to be a small value, which we choose to be 0.2. In conclusion, our initial parameters 
are given by 


fc u = 


h = 


«? 


@1 = 


7i 


2 St_ 


0.4 

0.2 

0.2 

0 . 2 . 


i |At 


5.3 Simulation 

We generate four sets of parameters using the initial values ho = 0 and ro = E (rt), each of which will result 
in a weakly stationary Int-GARCH process. The exact parameter values are listed in Table [T] Plots of the 
simulated data are shown in Fig [2} 


Model I Simulated 



Model II Simulated 



0 200 400 600 800 

1000 0 200 400 600 800 

Figure 2: Plots of simulated data sets each with T = 1000. 

Denote 


70) 

= Co v(r t ,r t+s ), 

7a(s) 

= Cov (A t , A t+S ), 

is (s) 

= Cov (S t , S t +s) ■ 

Recall that the theoretical ACF of {r t } is 


p(s) = 

7(s) _ 7a (s) + 7s(s) 

7(0) 7a( 0)+7a(0)' 


We consequently define the sample ACF of {r t } as 


P(s) = 


ix{s)+js{s) 
7a(0) + 7~<5(0) ’ 
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where 'fx(s) and 'fs(s) are the sample auto-covariance functions of {A*} and {<5*}, respectively. Figure [3] and 
[4] show the sample ACF’s for simulated data sets with 3000 observations from models I and II, respectively. 


Autocorrelation Function of Centers Autocorrelation Function of Half Lengths 




Autocorrelation Function of Process 



Figure 3: Sample auto-correlation functions of a simulated data set from Model I. 

For each of the four models, we simulate a data set with 3000 observations and calculate the CLSE of 
the parameters using the Newton-Raphson algorithm. We found from our experience that excluding k from 
the algorithm to search for the CLSE for {/x, aq, /3i, 71 } results in the algorithm being more stable. Since the 
initial estimate for k as described in Section [572] turns out to be good enough, it is used as the final estimate. 
An alternative strategy is to update k and the rest parameters in separate steps. We repeat the process of 
simulation and estimation for 100 times independently, and the overall results are reported in Table [l] We 
see that our conditional least squares estimates are very close to the true parameters with small empirical 
standard errors. 


6 Real data analysis 

In this section, we apply our Int-GARCH model to analyze the Dow Jones Industrial Average Index com¬ 
ponent stocks. We obtained the daily data of totally 3019 trading days, from January 3, 2000 to December 
31, 2011. Figure [5] shows plots of the return range data for 4 randomly selected stocks: BA, JPM, KO, and 
TRV. To reveal more information in these data, we highlight the days when the return range is long (above 
75% quantile of the entire data) but the average return is small (absolute value of the center is below 25% 
quantile). We see that most of such days occurred in the years 2000-2004 and 2008-2010. As we discussed 
before, these days show large variability in price, but the point-valued volatility models such as GARCH 
tend to underestimate it. The sample ACF of the BA return range is displayed in Figure [ 6 ] It is very 
similar to the theoretical ACF of model I in Figure [I] from Section |5.3| which indicates the feasibility of 
our Int-GARCH model. We estimate the parameters by the CLS method we proposed in Section [5] and 
list the results for 10 randomly selected stocks in Table [2j The fitted models show some patterns. 1) oq is 
much smaller than in magnitude, indicating that the return range is of much more importance than a 
“snapshot" return, in terms of their contributions to the volatility. 2 ) 71 is either very small or exactly 0 ; 
most likely an Int-ARCH model is sufficient here. 
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Autocorrelation Function of Centers 


Autocorrelation Function of Half Lengths 





Figure 4: Sample auto-correlation functions of a simulated data set from Model II. 


As we mentioned above, most of the trading days with large return ranges but small average returns 
(i.e., around the years 2002 and 2008) happened during the periods of market crash. This is consistent with 
one’s intuition: investors tend to behave irrationally and take swift actions frequently in the time of crisis, 
resulting in unusually large variability of the price. We are particularly interested in these periods when 
GARCH and our Int-GARCH models tend to differ significantly. In figure [7J we plot the estimated volatility 
based on both models for two randomly selected stocks for each of the two periods. They are quite consistent 
in terms of the overall trend, but our Int-GARCH curve shows much more detailed fluctuations than the 
smoother GARCH curve. And, just as we anticipated, the GARCH-based volatility is generally smaller than 
our Int-GARCH-based volatility. 

It is hard to say which estimate is better in the plots of Figure [7J In principle, the quality of volatility 
modeling is only judged by its usage in the financial applications. So our main purpose here is to analyze 
and interpret the unique characteristics of our Int-GARCH process, so as to provide practical guidance. To 
this end, we further compare our Int-GARCH estimate to the realized volatility (RV) with 5-min sampling 
frequency. Both of these two methods use intraday data to estimate volatility, but they have subtle differ¬ 
ences. RV is a model free estimator, and possesses very nice theoretical properties such as strong consistency 
(e.g., Andersen et al. 2001, Barndorff-Nielsen and Shephard 2002). It uses all or a large portion of the 
intraday transaction data, and therefore is expected to have a “fair" reflection of the intraday price variabil¬ 
ity. However, because of the microstructure noises, it usually cannot achieve its optimal performance. For 
example, it is generally believed that RV with moderate sampling frequency (e.g., 5-min) has a downward 
bias (e.g., Bandi and Russell 2006, 2008). Our Int-GARCH model reads in the intraday information through 
the high-low range. As we discussed before, although range is only a summary statistics, it does contain 
rich information about the intraday activities. So Int-GARCH model is expected to inherit to a considerable 
degree the advantage of RV -revealing detailed intraday variability. What is important, it is robust to the 
microstructure noises, thereby avoiding the intrinsic (downward) bias associated with RV. 

We display the comparative plots of Int-GARCH and RV for two stocks, AXP and WMT, for the period 
1/2/2000-12/31/2001, in Figure [8] The two estimates show very similar fluctuation patterns, the RV being 
smaller than the Int-GARCH in general most likely due to its downward bias. Particularly, we notice a few 
interesting trading dates from these plots to illustrate the advantages of Int-GARCH. On April 26, 2001, 
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Table 1: Average Result of 100 Repetitions 


Model 

Parameters 

Mean Estimate 

Mean Bias 

Empirical Standard Error 


k 

4.7162 

4.7173 

0.0677 

0.0832 


V 

0.4724 

0.4917 

0.0671 

0.0842 

I 

Oil 

0.2637 

0.2664 

0.0206 

0.0251 


Pi 

0.0906 

0.0887 

0.0055 

0.0063 


7i 

0.1796 

0.1778 

0.0383 

0.0475 


k 

2.7330 

2.7370 

0.0396 

0.0491 


A* 

0.1385 

0.1397 

0.0110 

0.0139 

II 

Oil 

0.2572 

0.2621 

0.0180 

0.0222 


Pi 

0.0202 

0.0192 

0.0059 

0.0073 


7i 

0.1459 

0.1398 

0.0521 

0.0651 


k 

5.4871 

5.5012 

0.0672 

0.0908 


V 

0.5331 

0.5359 

0.0453 

0.0574 

III 

Oil 

0.1782 

0.1751 

0.0127 

0.0154 


Pi 

0.0253 

0.0254 

0.0027 

0.0036 


7i 

0.1396 

0.1364 

0.0538 

0.0669 


k 

1.9108 

1.9103 

0.0286 

0.0358 


A* 

0.3640 

0.3654 

0.0384 

0.0458 

IV 

Oil 

0.2642 

0.2652 

0.0211 

0.0269 


Pi 

0.0228 

0.0216 

0.0083 

0.0101 


7i 

0.0705 

0.0704 

0.0745 

0.0884 


the RV of AXP shows a very large spike of $0,127, while the Int-GARCH and GARCH models give much 
smaller estimates of $0,031 and $0,035, respectively. The transaction data on that day reveals that the price 
mostly fluctuated around $41-42 for the whole day except for about half an hour in the afternoon it suddenly 
dropped to $38.43 but then quickly rose back to $41.90. The same situation happened to the WMT stock 
too on April 9, 2001, on which day the price suddenly soared from the stable range of $50-$51 to a much 
higher level of $82.5487 but only stayed there for 5 minutes. Such a lightening change of price usually is not 
due to the market volatility, so for this case RV is probably being too sensitive and giving a “false alarm”. 
Separately, another interesting phenomenon is observed for AXP stock on March 10, 2000 and for WMT 
stock on April 4, 2000, respectively. For these two cases, RV and Int-GARCH estimate are relatively close 
($0,059 and $0,048 for AXP, and $0,063 and $0,076 for WMT, respectively), but both are much higher than 
the GARCH estimate ($0,030 and $0,039 for AXP and WMT, respectively). To get insight into this, we 
plot the corresponding 5-min intraday prices for the entire days in Figure [9] In both plots, some significant 
fluctuations are shown in the later portion of the day, compared to which the closing-to-closing returns are 
quite small. So clearly the return-based GARCH model is underestimating the volatility here, and the RV 
and Int-GARCH estimate are more fair. There is yet another situation when Int-GARCH shows typical 
advantages. On January 22, 2001, the GARCH estimated volatility for the AXP stock is $0,037, while RV 
and Int-GARCH yield $0,072 and $0,074, respectively. We list the adjusted closing prices for that day as 
well as a few days nearby in Table [3] Notice that the return on January 22 is not really small, but the nearby 
returns are. So we reasonably conjecture that GARCH in this case is trapped in the “volatility clustering" 
mechanism and too rigid to capture the high volatility that is not in a “cluster". 

7 Conclusion 

The financial market today generates a huge amount of data every second, necessitating the development of 
new models and methods to analyze and take advantage of them. In this paper, we developed an interval¬ 
valued GARCH model for analyzing return range processes. It can be viewed as an extension of the point- 
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Daily Log Return Range of BA Stock Daily Log Return Range of JPM Stock 





Figure 5: Plots of the daily return range data for selected stocks from Dow Jones Index. 


valued GARCH model that allows for interval-valued returns. Yet its most important contribution lies in its 
capability to integrate realized measures and return-based model mechanism to produce “information-rich" 
estimation of the volatility. Inferences of our Int-GARCH model can be made based on the conditional 
least squares method we proposed, which was shown to have quite stable and reliable performances. Our 
empirical study of the Dow Jones stocks data demonstrated the advantage of Int-GARCH model: it enriches 
the low-frequency volatility model such as GARCH with the high-frequency information, without having 
to suffer from the microstructure noises. This is especially valuable during crash time, or in general when 
the market is unstable, because drastic intraday variation is hardly captured by the low-frequency return 
only. We compared the performance of Int-GARCH to those of GARCH and RV. It was further shown that 
Int-GARCH has improved model flexibility: the GARCH mechanism prevents it from being too sensitive to 
data like the model-free RV; the role of the intraday range also makes it less rigid than GARCH itself. 
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Table 2: Estimated Int-GARCH parameters for 10 selected stocks. 


Index 

Stock 

k 

Parameter Estimates 

P ai Pi 

7i 

Dow Jones 

AXP 

1.7524 

0.0016 

0.0635 

0.4884 

0.0072 


BA 

1.6828 

0.0025 

0.0686 

0.4687 

0 


BAC 

1.5909 

0.0017 

0.0762 

0.5337 

0.0084 


DD 

1.7744 

0.0019 

0.0209 

0.4817 

0 


JPM 

1.6873 

0.0018 

0.0379 

0.5195 

0.0021 


KO 

1.8150 

0.0013 

0.0383 

0.4620 

0 


MSFT 

1.7513 

0.0017 

0.0320 

0.4878 

0 


T 

1.8106 

0.0021 

0.0524 

0.4475 

0.0001 


TRV 

1.8167 

0.0015 

0.0092 

0.4877 

0 


WMT 

1.8415 

0.0015 

0.0376 

0.4590 

0 


Table 3: AXP daily adjusted closing prices for a selected time range. 


Date 

1/16/01 

1/17/01 

1/18/01 

1/19/01 

1/22/01 

1/23/01 

1/24/01 

1/25/01 

1/26/01 

Adj Close 

35.34 

36.16 

36.3 

35.57 

32.92 

34.02 

33.83 

33.92 

34.11 
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A Proofs 

A.l Proof of Theorem [Q 

Proof. We notice, by recursive calculations, that 
h t = n + xt-iht-i 

= n + x t -i (n + xt-zht-z) 

= ^ + IIXt-1 + Xt-lX t -2ht-2 

= Ll + /iXt-l + A lXt-lXt-2 H-1- HXt-lXt-2 ■ ■ ■ Xt-N 

+Xt-lXt.-2 ■ ■ ■ X t -(N+l)ht-(N+l ) 

( N i \ N +1 

i+E n Xt.-j J + h t _( N+ 1 ) x t -j, VN £ N. 

i=i 1 =1 J j =1 

Taking expectations on both sides, we get 

N i \ N+l 

En Xt-j J +/l(_(]V+l) Xt-j 
*= 11=1 / 1=1 

N 

= (ExtY + ( Ex t ) N+1 Eh t _( Af + i), 

i=0 

for all N £N. Letting N —> 00, 

OO 

Eh t = mE ( Ex tY + lim (Ex t ) N+1 Eh t _, N+l ) 

zJ N—>00 

i= 0 

00 

= mE (^) 1 

i=0 

h 

1-E(x t ) 

_ h _ 

1 - axy/ifi: - /3ik - 71’ 

since l-Ea^l < 1. On the other hand, if Ext > 1, then (Ext) 1 = 00. Therefore, Eht < 00 if and 

only if 

\Ex t \ = \E(ai\e t -i\+Ptrit-i+ji)] 

= ai\f2/-K + 0ik + 71 
< 1, 

and when this is satisfied, the Aumann expectation of r t is found to be 

Er t = E [h t {e t - r] t ), h t (e t + Vt)\ 

= [E (h t ) E (e t - r] t ), E (h t ) E (e t + T) t )] 

= [-kE (h t ), kE (h t )] ■ 


Eh t = E \ n\\ + 
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o. 


A.2 Proof of Theorem [2] 

Proof. Recall, from the proof of Theorem [T] that 

JV i 


JV+1 


Consequently, 


Note that 


and 


ht = (i 1 + £n Xt—j I + /i^_(jv+i) 11 xt—j , \fN G N. 


i= 1 


3 =1 


AT 2 


AT+1 


h 2 t = ( [l 1 + zn •Et—j I hi— (JV+1) 11 VCt—j 

\ '=1 J =1 / i=i 

' N i \ 2 / N i 

1 + zn I + 2/i I 1 + zn •Et—j I I hi_ (jv+i) |1 ^t—j 


JV+1 


= V 


i=l j=1 


i =1 f=1 


1=1 


JV+1 
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3 = 1 


AT 


JV+1 
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*=1 i=i / i=i 


JV+1 / JV 


JV+1 


+2/x/i t _(jv+i) H a:t_j I £ I + /i 2 -(jv+i) II 

3 =1 \*=i j =1 / i=i 


JV i 

1+£ n 

i=i j =1 

(1 + 0+-1 + Xt-iXt -2 + ' ' ' + Xi-iXt -2 ‘ ‘ ‘ Xt-N) 
N i JV i JV-1 

1 +£ n x t~ 3 + 2 £ n xt ~o + 2 £ 
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Therefore 
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Let C\ = E (x t ) and C 2 = E (x 2 ). By the independence of /i t _(jv+i) and Vj < TV +1 and the fact that 
{a+} are iid, we obtain, 
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The geometric series Yh=iCI an d YiL 1^1 converge if and only if \C\\ < 1 and \C 2 \ < 1 . For the fourth 
term in the above expression, 


iV —1 
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N-l 


= E 
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N- l , „ X i N- l 


,ViV £ N. 


This is a finite number when N —» 00 if and only if IC2I < |Ci|. Finally, 

JV N / \ i 

YciC ?~ i+1 = C'f^E (^) >VATeN 

is finite for TV —00 if and only if IC2I < \Ci\. Therefore, E (hf} < 00 if and only if \C 2 \ < \Ci\ < 1 , or 
C 2 < Ci < 1 , since both Ci and C 2 are positive. 

Under this assumption and in addition that EhY^ < 00, letting N —> 00, we find the second moment of 
h 2 to be 


E(h 2 t ) = /r 2 (l- 


_ o <?! , 9C ^ _ 1 

C 2 -l Ci-1 lG2 (C 2 -l)(Ci-l) 
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Ci + 1 


(C 2-1) (Cr — 1) ’ 

Consequently, the unconditional variance of r t is 

Var (r t ) = Var {h t £t) + Var (h t r]t) 

= (1 + k + k 2 ) E (h 2 ) - k 2 [E {h t )} 2 . 

This completes the proof. 

A.3 Proof of Theorem [3] 

Proof. First we notice, Vf, s £ N, 

r t = [h t £ t - h t T]t , h t e t + h t T]t ], 

^t-\-s = [ ht+s£t-\-s ht+s^lt+s 1 ht-\-s£t-\-s “i - ^t-\-s^]t-\-s\ ■ 

and therefore, 


Cov (r t , r t+a ) = Cov ( h t s t , h t+s e t+s ) + Cov (h t 'q t ,h t+s r] t+s ). 


The first term 


Co v(h t £t,h t + s £t+s) = 


E [(htSt — E ( htet )) (ht+s^t+s — E ( h t+s ct + s ))] 

\E{h 2 e 2 ), s = 0 

[E (htht+s^t) • E (et+s) j |s| > 0 

\e{e 2 ) -E{h 2 ), s = 0 
\ 0 , |s| > 0 

(E(h 2 ), 3 = 0 


0 , 


1*1 >0 


□ 


(37) 


(38) 
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since {e t } are i.i.d. 


Similarly, the second term becomes 
Cov (h t r] t , h t+s r] t+s ) 

= E [(h t r]t - kE ( h t )) (h t+s Vt+s - kE ( h t+s ))] 

= E (htht+sVtVt+s) ~ kE (h t ) E (h t+s r] t+s ) - kE (h t+s ) E (h t ij t ) + k 2 E ( h t ) E (h t+s ) 
= E (h t h t+s r] t r]t +s ) - kE ( h t ) kE (h t+s ) - kE ( h t+s ) kE (h t ) + (/i t ) E (h t+s ) 

= E (htht+sViVt+a) - k 2 E (h t ) E (h t+s ) 

f ^ ( h rV?) - fc 2 [-E (*t)] 2 . « = 0 

\E (hth t+s r]t) E (r] t+s ) - k 2 E (ht) E (h t+s ) , |s| > 0 

I (k + k 2 ) E (h 2 ) - k 2 [E (h t )] 2 , s = 0 
\kE(h t h t+sVt )-k 2 [E(h t )] 2 , |s| > 0. 


Plugging (38) and (391 into (137]), we obtain 


f(l + k + k 2 )E(h 2 t )-k 2 (E(ht)} 2 , s = 0 
t+S> \kE (htht+sVt) ~ k 2 [E (h t )}\ H>0, 


Wh6re E {ht) = T=Cl' E W = ^ (C 2 -1) + (C 1 -1) 


, and 


E (htht+sVt) = 


H 2 k ( C {-1 Cf + C- 


S — 1 
1 


ai\l ~ + Pi (1 + k) +7i 


Ci - 1 y Ci-1 C 2 - 1 

(see Lemma [lj Appendix). 

A.4 Proof of Theorem [4] 

Proof. By the definition of ( [25] ) , ( |l2| ) can be rewritten as 

p q w 

h t = + + y ^ Piijt-jht-i + ^7 jh t -i 


i— 1 
k 


i—1 


i—1 


— ft ^ ^ %i,t—jht—i • 


i— 1 


Expanding recursively, we obtain 


(39) 


□ 


ht — ft ^ ^ I H” ^ ^ ht—i—j 

i= 1 V i=l 

/ k \ k k 

= fl | 1 -|- ^ ^ I “ 1 “ ^ ^ ^ J ' j 3'i,t—j3'j 7 t—i—jht—i—j 

) i= 1 j=l 


2=1 


= ft 


N k k 

1 + EE'"E (II*ij 

n=l 2 i = 1 2 =1 Y7=l 


fc fe / AT+1 

+ E '' ' E ( IT -b ) k't—ii -IJV+1- 

*1=1 *„AT+1=1 \i = l 


(40) 
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Notice that x^t and Xj yS are independent, Vi, j € N and t ^ s. Taking expectations on both sizes of (401, we 
get 


E{h t ) = /i 


N k kin 

1 + E E • •• E iU 

n= 1 i\—l i n = 1 \J = 1 


< h 


< A 4 


k k / N+ 1 

' E ' ' ' E I IT ^ (ht-h -ijv+i) 

il=l 2JV + 1=1 \j = 1 

N / k 

1 + E E ^ 

n=l \j=l 

fc fc / iV+1 

E ••• E II /'<, ] inax {£ : iV + 1 < l < k{N + 1)} 

*1=1 *1V+1=1 \J = 1 

N ( k 

1 + E E w 3 - 


n=l \ j = l 


IV+1 


+ max{.E {h t _i) : N + 1 < l < k{N + 1)} | y 

U =1 


V N € N. Therefore, E (Jit) < oo if and only if 


E^ <L 


i=l 


When it is satisfied, 


E (ht) = lim /r 

N—> OO 


Nik 

1 + E I>. 

n=l \j=l 


AT+1 


+ lim max {A ( ht-i ) : AT + 1 < l < k(N + 1)1 I y^ Pi 

N— loo l z ' 

V =1 


1 7 

1 — zDi-i A 4 * 

by the finiteness of E (h_ oo). The formula for A (77) follows immediately from the Aumann expectation. □ 


A.5 Proof of Corollary [l] 

Proof. It is immediate from Theorem EU and 0 


□ 


A.6 Proof of Corollary [2] 


Proof. The Auto-correlation Function (ACF) of {r t } is defined to be p{s) = Corr (r t , rt+ a ) 


the ACF of {r t } is 


1, s = 0 

P(s) = < kE (h t ht + sV t ) - k 2 [E (ht)] 2 

[ (1 + k + k 2 ) E {h 2 t ) - k 2 [E {ht)} 2 ’ 


7(«) 

7(0)' 


Then 


□ 
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B Lemmas 


Lemma 1. 


(?) E {rj t x t ) =ai^j^-k + Pi (k + k' 2 ) + 7 ifc; 

fi 2 k ( C {-1 Cf + C- 


( H ) E (htht+sVt) = 


Proof, (i) 


s —1 
1 


Cl - 1 \ Cl - 1 C 2 - 1 


ol\ \ —h /3i (1 + /c) + 71 

7r 


= E [rjt • (07 |et| + /Ci? 7 t + 71 )] 

= aqE Ot|) • E ( r] t ) +Pi-E (? ?i 2 ) + 71 E (r) t ) 

= ai^— • k + /3i (k + k 2 ) + 71 fc 


(ii) First, we expand recursively: 


= /i + /iXt+ s _i + 27+s-l37+s-2^£+.s-2 


= /Z + /ZXt+s-l + M^t+s-l^+s-2 H- 

-\-fU,Xt-\-s—l%t-\-s—2 * * ‘ 37+1 H"~ ^7+s—l*£t+s—2 * * * %tht 
s—li \ s 

- Mi+zn •Kt+s—j I T h't | | ^t+5— j' 


*=1 i=i 


j=i 


Consequently, 


H s-i * \ s 

1 + En a.'t+. s -j + h t P Xt+s-j 

*=1 j=i / t=i 

S —1 Z S 

= + mM* En Xt.+s—j hfVt 11 Xf+s—j • 

*=1 i=i i=i 


Then the expected value is found to be 


E ( h t h t+s r ) t ) = fj,- E (h t ) ■ E (ry t ) + p • E (/i t ) • E (ry t ) • E 


+-® 1 (^t) ' P a-'t+s-i 

= /ifc • E (/i t ) -\- fik ■ E ( h t ) 

s-l 


s— 1 i 


e n (*t+«-j) 

*=1 i=i 


e n ^ ( x t+s-t) 

*=1 i=i 


+E (/i 2 ) • E (■ rj t xt ) • P E (aj t+s _j) 
j=i 

C! (CT 1 - 1) 


= /ifc • E (/i t ) + fxk • E (ht) ■ 
+E (h 2 ) ■ E (rjtxt) ■ C{-\ 


C\ 1 


23 



where C1 = E (x t ). 

Finally, remembering that E (hi) = /./, 


C + 1 

2 —- -—, the above calculation is simplified to 

(C-2 — 1) (Ci — 1) 


E {htht+aVt) = hk-- -— • 

t — c l 

+E{h 2 t )- 

1 


1 + 


c, (ci- 1 - 1) 
Cl -1 


= n k 


d. i • fc + Pi (k + A 2 ) + 7!A; 
Ci -1+Ci (cr 1 -1) 


•cr 


1 - C, 

+£(**?)■ 

Cf - 1 


Cl - 1 


Q:i \j^ ■ k + /3i (k + k 2 ) + 7i k 


•cr 


Ci + l 


- ,,2 fr . ^1 x I ..2_ 

“ M (C!-l) 2+M (C 2 -l)(Cr-l) 


oii\j— ■ k + pi (k + k 2 ) + 7ifc 


•C 


s-l 

1 


M 2 fc 


cf -1 cf+cr 


Cl - 1 \ Cl - 1 


Co-1 


ai\ - + Pi (1 + A) +71 

7T 


□ 


Lemma 2. 


C (a; 2 ) — ci 2 + Pi (k + /c 2 ) + 7i + 2o;i/3i \^~k + 2cti7i^/““ + ‘^Pi'Jik. 


Proof. 


E(x 2 ) = E \e t \ + Pir]t + 7 i) 2 ) 

= E (ale 2 + Pfrjl + 7 2 + 2 aiPi \s t \ rjt + 2«i7i \e t \ + ZPi'nVt) 

= a\E (el) + PlE (rjl) + 7 i + “ZotiPiE (|et|) • E (rjt) + 2 ai 7 i E (|et|) + 2 PijiE (rjt.) 

= a 2 + p 2 


-k + 2ai7iW —h 2/3i7ifc. 


□ 


24 









AXP Volatilities 



Year 


WMT Volatilities 



Year 


Figure 8: Plots of the Int-GARCH estimated volatility versus RV for AXP and WMT stocks for the period 
1/2/2000 - 12/31/2001. 


AXP 5-min Intraday Price on 0311012000 WMTSmin Intraday Price on 04/0412000 




Figure 9: Plots of the 5-min intraday price for AXP stock on March 10, 2000 and WMT stock on April 4, 
2000, respectively. 
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